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(3) Arithmetic operation apparatus for elementary function. 



© An arithmetic operation apparatus performs an arithmetic operation of a differentiable elementary 
function including a sine function sin(x), a cosine function cos(x), arc tangent function arctan(x), an 
exponential function e*. a logarithm function log # (x), an inverted number 1/x, a square root vx, and an 
inverted number of a square root 1/Vx. The apparatus comprises a divider receiving an initial value (x) 
for dividing the initial value into a more significant digit portion H and a less significant digit portion L (x 
« H + L), and aW-word memories of "k" banks receiving the more significant digit portion H of the initial 
value (x) and storing a value obtained by multiplying the more significant digit portion H by a previously 
calculated constant b 0 = f(H) or b k = (1/k I) x f *> (H) (where k is posffive integer). An arithmetic circuit 
composed of a multiplier and an adder receives the less significant digit portion L of the initial value (x) 
and an output of the memories for executing the following polynomial : 
f(x) = bo + L x {bi + L x • • • (bfc., + b k x L} •) 
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ARITHMETIC OPERATION APPARATUS FOR ELEMENTARY FUNCTION 

Background of the Invention 
Field of the invention 

The present invention relates to an arithmetic operation apparatus for elementary functions in computers.. 
Description of related art 

10 Computers for handling numerical 4ata has4o have numerical functions; particularly elementary functions 

including a sine function sin(x), a cosine function cos(x). arc tangent function arctan(x j, an exponential functi n 
ex, a logarithm function log,(x), an inverted number 1/x, and a square root Vx. As a manner for calculating these 
elementary functions, the following algorithms have been known : > 

(1) approximate value method 

15 This method utilizes a polynomial approximation such as Chebyshev expansion, Taylor expansion, and 

others, and a rational formula approximation such a continued-fraction expansion, Which have been ordinarily 
used for obtaining an elementary function. However, Which of the expansions should be selected dependency 
upon a function to be obtained. In addition, jin the case of calculation of the elementary function of a floating 
point number in a doubled precision (precision of 52 digits in a binary number), multiplication and addition must 

20 be repeated more than terf times, and therefore, along arithmetic operation time is required. 

(2) CORDIC method u : ^ ! 

This is used for obtaining a trigonometric function and an inverted trigonometric function. This can unitarily 
calculate a plurality of elementary functions. 1 However, for calculation of the elementary function In a doubled 
precision (precision of 52 dfgitsjin a binary dumber), addition/subtraction and shift must be executed 52 x 3 
25 times, and therefore, a long arithmetic operation time is required. j : 

(3) STL method '••*'.•• : • j > : | .; ; - ^ ; 

This Is used for an exponential function arid a logarithm function: However, for calculation of the elementary 
function in a doubled precision (precision of 52 digits in a binary number), addition/subtraction and shift must 
be executed 52 x 2 times, and therefore, a long arithmetic operation time; is required. 
30 (4) Newton method ......... ^. . . . 

This can be used only for a square root and a cube root, and a calculation equation should be selected 
dependent!/ upon a function to be obtained. An initial value is given in the form of an approximate value, and 
addition/subtraction/multiplication/division is repeated until a required degree of precision is obtained. If the ini- 
tial value has a relatively high precision, the repetition can be completed one! or two times. For calculation of 
35 the elementary function in a doubled precision (precision of 52 digits in a binary number), the method can be 
realized with one time of reading of a table ROM (read only memory) and three to six times of addition/sub- 
traction and multiplication. Therefore, a required arithmetic operation time is relatively short 

In any of the above mentioned conventional methods, it is rare that the same algorithm can be appli d to 
a full range of argument Therefore, it is an ordinary practice to convert the argument so that calculation is exeo- 
40 uted in a limited range. 

For example, In the case of calculation of sin 9. it is assumed that 9 = px«/4 + x (where p is integer), and 
sin(x) and cos(x) are calculated in a range of 0 ^ x < n/4. A procedure of limiting a range of argument as in this 
method is called an "argument reduction". 

As seen from the above description, the above mentioned elementary function arithmetic operation 
45 methods are disadvantageous in the following points : 
(1) The arithmetic operation time is long. 

Multiplication and addition must be repeated more than ten times in the approximate value method, and 
shift and addition/subtraction must be repeated more than ten times in the CORDIC method and the STL 
method. Therefore, the arithmetic operation time is very long, 
so (2) A plurality f elementary functions cannot unitarily be handled. 

An optimum approximate value is different if the elementary functi n is different, and therefore, th 
approximate value method cannot unitarily process a plurality of elementary functions. On the other hand, th 
STL method and the Newton method are limited in elementary functions which can be treated. The CORDIC 
method can unitarily handle the trigonometric function and the inverted trigonometric function, but cannot treat 
55 the exponential function and th logarithm function. 
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Summary of the Inv ntion 

5 Accordingly, it is an object of the present invention to provide an elementary function arithmetic operation 

which has overcome the above mentioned defect of the conventional one. 

Another object of the present invention is to provide an elementary function arithmetic operation having a 
table for an elementary function f(x) and previously calculated numbers |W (x)/k I which are obtained by multi- 
plying a k-order differentiation of f(H) with a predetermined constant, the elementary function arithmetic oper- 

w ation capable of quickly and precisely obtaining the elementary function arithmetic operation f(x) by a one-time 
searching of the table, "k" times of multiplication and "k" times of addition (where k is an integer in a rang of 
1 to 6 inclusive) 

The above and other objects of the present invention are achieved in accordance with the present invention 
by an arithmetic operation apparatus for arithmetically operating a differentiable elementary function including 

is at least one of a sine function sin(x), a cosine function cos(x), arc tangent function arctan(x), an exponential 
function e* f a logarithm function log e (x), an inverted number 1/x, a square root Vx, and an inverted number f 
a square root 1/Vx, comprising dividing means receiving an initial value (x) for dividing the initial value into a 
more significant digit portion H and a less significant digit portion L (x = H + L), 2«^word memories of "k" banks 
storing previously calculated constants bo = f(H) orb k = (1/k !) x fW(H) (where k is positive integer) which are 

20 obtained by multiplying a k-order differentiation of f(H) by given constants, each of the memories receiving as 
an address the more significant digit portion H of the initial value (x) for outputting one of stored previously cal- 
culated constants designated by the address, and at least one arithmetic circuit composed of a multiplier and 
an adder and receiving the less significant digit portion L of the initial value (x) and an output of the memories 
for executing the following polynomial : 

25 fW-bg + Lx^ + Lx - -(bin +b k xL}--.) , 

In one embodiment, assuming k = 3, the elementary function f(x) can be obtained with precision of 4n digits 
by three times of multiplication and three times of addition. In another embodiment, assuming k = 3, the elemen- 
tary function f(x) can be obtained with precision of 2n digits by one time of multiplication and one time of addition. 
Now, a principle of the elementary function arithmetic operation in accordance with the present invention 

30 wfll be described. 

Assume that the elementary function f(x) is infinitely drfferentiable in a range of a given argument. The 
elementary function f(x) can be developed into a Taylor series/Assuming x = a ( the Taylor development can 
be expressed as follows : 
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f(a+8) «f(a) + f (1) (a)xS/l! 

+ f (2) (a) x S/2! 
+ f (3) (a)x5/3! 



f (k) (a) x5/k! (1) 



This Taylor development is disadvantageous in that if an absolute value of 8 is large, an error becomes 
large, and the order number Tc" of the approximation equation becomes large. On the other hand, if the absolute 
value of 6 is made very small, the precision of the arithmetic operation can be elevated, and the order number 
"k* of the approximation equation can be reduced to an range of 1 to 5. 
so The elementary function arithmetic operation apparatus in accordance with the present invention is inten- 

ded to make the range of 6 as narrow as possible and to fall the order number "k" to a possible extent, by pre- 
viously calculating values of f(a),fH)(a), fW(a), .„* *>(a) for as many initial numbers "a" as possible in the equation 

Here, consider the case in which an elementary function f(x) is sought in a binary number of 4n digits, from 
55 a given number V represented by a binary number of 4n digits (0 £ x< 1 ). 

First, the given number V is divided into a more significant digit portion H and a less significant digit portion 
L by units of "n" digits. H and L correspond to "a" and "6" of the equation (1), respectively. 
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4n 

-Z {x k x 2" k } « H + L (2) 

k=l ...... 



where xk = {lv 0} - 



H = I {x k x 2' k } 7 (4) 



L = S n {x k x 2: k ) = , 7 (5) 



Here, putting a = H and S = Lin the equation (1), since L < 2* and L 4 < 2^, the term of L 4 and subsequent 
terms can be neglected or Ignored. Therefore^ the equation (1) can be obtained with sufficient precision by limi- 
ting as k = 3. Namely, "\ 
30 'f(x)«l(H)Hl«KH)x^ 

If this equation (6) is optimized tn order to minimize the number of multiplication operations, the following 
equation (7) can be obtained ; 

f(x) = f(H)+Lx^i)(H)+U{(1/2)fW(H^ 

In this equation (7), if there is prepared a table ROM which receives H as an input and outputs the following : 



bo = f(H) 

bk = f (k) (H)/k ! (8 ) 



(where k= 1, 2, 3) 

the foBowing equation (9) can be obtained : 

f(x) = bo+Lx{b,+Lx(b 2 +Lxb s )} (9) 
Therefore, f(x) can be obtained with three multiplication operations and three addition operations. 
45 Assuming that f(x) has no singular point within a d esignated range of V, the value of b k will never become 

large. In addition, since L<2*, \J<2** % and L><2- 3n , the constant table for the equation (9) is sufficient if it has 
4n digits for b 0 , 3n digits for b 1t 2n digits for b 2 , and n digits for b 3 . 

Therefore, in order to obtain a value of the elementary function f(x) with precision of 4n digits, only a table 
ROM of (4n+3n+2n+ri) digits x 2" words, multipliers and adders are required. Here, since the precision of mul- 
eo tiplication of b 3 xL can be limited to "n B digits, some of the required multipliers and adders can have a reduced 
precision. In other words, all the required multipliers and adders are not required to have the same degre of 
precision of 4n digits. 

In addition, some elementary function f(x) has an argument range of 1 ^ x 2. However, since an integer 
portion of "x" Is fixed to "1", the table ROM is sufficient if it has 2 n words; 
55 The above and other objects, features and advantages of the present invention will be apparent from the 
following description of preferred embodiments of the invention with reference to the accompanying drawings. 

Brief Description of the Drawings 

Figure 1 is a bl ck diagram of an embodiment of the elementary function arithmetic operation apparatus 
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in accordance with the present invention ; and 

Figure 2 is a block diagram of another mbodiment f the elementary function arithmetic operation 
s apparatus in accordance with the present invention. 

Description of the Preferred embodiments 

Referring to Figure 1 1 there is shown a block diag ram of an embodiment of the elementary function arithme- 
10 tic operation apparatus !n accordance with the present invention. The shown apparatus includes a register 1 
for receiving and holding an input variable V. This register 1 has a capacity of 4n bit or more. More significant 
VT bits of a decimal fraction portion of the variable V held in the register 1 are read out as a H signal 2, and 
least significant "3n" bits of the variable V* held in the register 1 are read out as a L signal 3. More significant 
"2n" bits of the least significant "3n" bits are outputted as a L2 signal 4, and more significant "n" bits of the least 
15 significant "3n H bits are outputted as a L1 signal 5. 

In addition, the shown apparatus includes a ROM (read only memory) 6 of 4n bits x 2* 1 words for holding 
values of f(H), a ROM 7 of 3n bits x 2° words for holding values of f 1 >(H), a ROM 8 of 2n bits x 2 n words for 
holding values of ffR)(H) / 2, and a ROM 9 of n bits x 2 n words for holding values of f 3 >(H) / 6. Furthermore, the 
shown apparatus includes a multiplier 10 of "n" bits x "n" bits receiving the L1 signal 5 and an output of the 
20 ROM 9 for the purpose of calculating a product of the L1 signal 5 and the output of the ROM 9, and an adder 

1 1 of 2n bits for adding an output of the ROM 8 with a data which is obtained by rightward shifting an output 
of the multiplier 10 by "n" bits. Art output of the adder 1 1 and the L2 signal 4 are supplied to another multiplier 

12 of "2n" bits x "2n" bits, which in turn outputs a product of the output of the adder 1 1 and the L2 signal 4. An 
output of the multiplier 12 is rightward shifted by "n" bits, and inputted to ona input of another adder 13 of 3n 

25 bits, which has the other input connected to receive an output of the ROM 7. An output of the adder 13 and the 
L signal 3 ara supplied to a third multiplier 14 of "3n* bits x 3 3n" bits, which in turn outputs a product of the 
output of the adder 13 and the L signal 3. An output of the multiplier 14 is rightward shifted by "n N bits, and 
inputted to one input of a third adder 1 5 of 4n bits, which has the other input connected to receive an output of 
the ROM 6, so that the n-bit rightwarrj-shifted output of the multiplier 14 and the output of the ROM 6 are added. 

30 An output of the adder 15 is connected to an output register 16 of 4n bits. 

The following table shows various elementary functions f(x) which can calculated by the apparatus shown 
in Figure 1 and which therefore are to be held in the ROM 6, coefficients to be held in the ROMs 7, 8 and 9, 
and the range of the argument *x". 
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Table 1 



uefncnuiry 
Function 
f(x) 


1st - Order 
Differentiation 
f< l > (x) - 


2nd - Order 
Defferentiatidn/2! 
f<2) (x)/2 


3rd - Order 
DifFerentiation/3 ! 

; ifto (x)/6; 


Range of 
Argument 


sin (x) 




//•:;.^sin'(xy2-\.;;' 


/ - cos (x)/6 \. 


QSx* tv/4 


cos(x) 


- sin (x) 


- cos (x)/2 


sin (x)/6 




arc tan (x) 




" . -x 


3x 2 - i 


<Kx<l 


. <X 2 + I) 


: (X 2 + l) 2 ■ 


3(x 2 -l) 3 


c x 


■ ,,,-eX : • 


- t % a 


• e x /6 


0£x£l6g c 2 


log„(x) 








i 


1/x 


-l/x* 




-1/x 4 ^ ' 


l<x<2 


YT 


1 

2/x" 


-1 

8x/x" 


1 

16x 2 Vx" 


1<X<4 


1/Vx" 


-1 
2x /x" 


3 

8x 2 /x" 


-5 
16x 3 vr 


l£x<S4 



For example, if the elementary function f(x) to be obtained is sin(x). it could be understood from the table 
1 that sin(H) is stored in the ROM 6. cos(H) is stored in the ROM 7 and — sin(H)A2 and — cos{H)/6 are stored 
in the ROMs 8 and 9, respectively. 

In the apparatus shown in Figure 1. if the argument V is inputted and registered in the input register 1, 
the H signal 2. which is composed of the more significant "n" bits of the decimal fraction portion of the argument 
V held In the register 1, is supplied to the ROMs 6 to 0 as an address. On the other hand, the multiplier 10 
executes multiplication of th L1 signal 5 and a value of fW(H>/6 read out of the ROM 9. A valu of LxfW(H)/6 
outputted from the multiplier 10 is rightward shifted by "n" bits for d cimal point matching, and is added with a 
value of fW(H)/2 read from th ROM 8, by means of the adder 1 1 . Th multiplier 12 multiplies an output value 
of the adder 1 1 by the L2 signal 4. An output of the multiplier 1 2 is rightward shifted by "if bits for decimal point 
matching, and is added with a value of fl 1 >(H) read from the ROM 7, by means of the adder 13. The multiplier 
14 multipii s an output value of the adder 13 by the L signal 3. An output of the multiplier 14 is rightward shifted 
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by "n" bits for decimal point matching, and is added with a value of t(H) read from the ROM 6, by means of the 
adder 15. 

5 As seen from the above, the arithmetic operation of the elementary function f(x) requires only a total times 

of an access time for the table ROMs and a time required for the three multiplication operations and the three 
addition operations. 

In the case of obtaining a value of a given elementary function f(x) with precision of 52 bits (n=1 3), 

ROM 6 has a memoty capacity of 52 bits x 8132 Words ; 
10 ROM 7 has a memory capacity of 39 bits x 6192 words ; 

ROM 8 has a memory capacity of 26 bits x 8192 words ; 

ROM 9 has a memory capacity of 13 bits x 8192 words ; : 

Multiplier 10 is a multiplier of 13 bits x 13 bits ; 

Multiplier 12 is a multiplier of 26 bits x 26 bits ; 
15 Multiplier 14 is a multiplier of 39 bits x 39 bits; 

Adder 11 is an adder of 26 bits ; 

Adder 13 is an adder of 39 bits ; and 

Adder 15 is an adder of 52 bits. 

In the above case, a total memory capacity of the ROMs is 1,064,960 bits, which can be realized in the 
20 form of an LSI according to a recent semiconductor technique. 

Furthermore, ff each of the multipliers 10, 12 and 14 Is constituted of a carry save adder and a carry propa- 
gate adder, the amount of hardware and the time of arithmetic operation can be effectively reduced by con- 
stituting each of the multipliers 10, 12 and 14 and the adders 11,13 and 15 by the carry save adder and by 
putting carry propagate adder after the adder 15. 
25 Turning to Figure 2, there is shown a block diagram of another embodiment of the elementary function 

arithmetic operation apparatus in accordance with the present invention. The first embodiment has a high deg- 
ree of precision for the purpose of a doubled precision floating point data (such as 52 digits in a binary notation). 
However, the second embodiment is for a single precision floating point data (such as 24 digits in a binary nota- 
■ tion). Specifically, the first embodiment has the precision of 4n digits under k=3 in the equation (3), and on the 
30 other hand, the second embodiment has the precision of 2n digits under k=1 . 

In the second embodiment therefore, the equation (9) can be expressed as follows : 

fto-bo+Lxbi (10) 

Therefore, f(x) can be obtained with one multiplication operation and one addition operation. In addition, 
since H<1 and L<2- ft , the constant table for the equation (10) is sufficient if it has 2n digits for b 0 , and "n" digits 

35 for by , 

The second embodiment includes a register 1a of 2n bits for receiving and holding an input variable V. 
More significant "n" bits of a decimal fraction portion of the variable *x* held in trie register 1a are read out as 
a H signal 2, and least significant "n" bits of the variable "x" held in the register 1 are read out as a L signal 3a. 
In addition, the shown apparatus includes a ROM 6a of 2n bits x 2 n words for holding values of f(H), and a ROM 

40 7a of n bits x 2 n words for holding values of ft 1 >(H) t which is a first-order differentiation of f(H). Furthermore, the 
shown apparatus includes a multiplier 10 of "n" bits x "n" bits receiving the L signal 3a and an output of the 
ROM 7a for the purpose of calculating a product of the L signal 3a and the output of the ROM 7a, and an adder 
11 of 2n bits for adding an output of the ROM 6a with a data which is obtained by rightward shifting an output 
of the multiplier 10 by "n* bits. An output of the adder 6a is connected to an output register 16a of 2n bits. 

45 If the argument "x* is inputted and registered in the input register 1 a, the H signal 2, which is composed of 

the more significant "n" bits of the decimal fraction portion of the argument V held in the register 1a, is supplied 
to the ROMs 6a and 7a as an address. On the other hand, the multiplier 10 executes multiplication of the less 
significant "n" bits of the argument "x" (the L signal 3a) and a value of fl 1 >(H) read out of the ROM 7a. A value 
of Lxf< 1 >(H) outputted from the multiplier 10 is rightward shifted by "n" bits for decimal point matching, and is 

so added with a value of f(H) read from the ROM 6a, by means of the adder 1 1 . As the result, the adder 1 1 outputs 
f(x)~f(H) + UftO(H). 

As seen from the above, the arithmetic operation of the elementary function f(x) requires only a total times 
of an access time for the table ROMs and a time required for the one multiplication peration and the one addi- 
tion operation. 

55 In the case of obtaining a value of a grv n elementary function f(x) with precision of 24 bits (n=12), the ROM 

6a has a memory capacity of 24 bits x 4096 words ; the ROM 7a has a memory capacity of 12 bits x 4096 words; 
the multiplier 10 Is a multiplier of 12 bits x 12 bits; and the adder 11 is an adder of 24 bits. 

Therefore, a total memory capacity of the ROMs is 147,456 bits. Therefore, in order to realize the eight 
elementary functions shown in the table 1, a memory capacity of 1,179,648 bits is required, which can be 
realized in the form of an LSI. 
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As seen from the above, the elementary function arithmetic operation apparatus in accordance with the 
present invention is advantageous in the following tw points : 

(1) Th arithmetic operation time is short. 

In the first embodiment of t the elementary function arithmetic operationapparatus, the required processing 
time is a total tine of the reading time of the table ROM, a triple t of the multiplication time and a triple of the 
addition time. In the second embodiment the required processing time is a total time of the wading time of the 
table ROM, the multiplication time and the addition time, ): , ;:y 3 . . T . : . ; ; 

Now. assuming that the reading time : of the tebla ROM is 0.20. its, ihe jnultiplication time is 0.20 (is, and 
the addition time is 0.05 jis, the require^ processing time Is 0.95 fis in the [first embodiment and 0.45 \is in the 

second embodiment ; , : ,- 3 -o-- Mi.;; v ". v:----.; ■;* 

(2) A plurality of elementary functions. if<(x) pari b$ unitarily, handled. t t . \ 

The elementary function arithmetic operation apparatus In .s^^iance. ; WHh invention can 

change over the elementary function f(x) by changingdate stored in. the table' ROM. Therefore, "rf table ROMs 
corresponding to the eight kinds of elementary functions shown in the table 1, one of the eight kinds of elemen- 
tary functions, sin(x), cos(x), arctan(x), e* log # (x)i 1/x, Vx, and 1/vx can be selectively calculated. 

The invention has thus been shown and described with reference to the specific embodiments. However, 
ft should be noted that the present invention is in no way limited to the details of the illustrated structures but 
changes and modifications may be ..made yrithin the scope of the appended claims. 

Claims . \^ r \ ^ - ; j.-. : r; r - ; ! .r/ -c 

1. An arithmetic operation apparatus for arithmetically operating a differentiable elementary function compris- 
ing dividing means receiving an, initial value (x) for dividing said, initial value into a more significant digit 
portion H and a l^ss^nm<^pf digit portion L (x = H + L), 2*rWor*i jrr^mqries of -k" banks storing previously 
calculated constants bp ^'*[H).^.^?-(?* !) * f^^yXwhere k is positive integer) which are obtained by 
multiplying a k-ordet differentiator given constants, each of said memories receiving as an 
address said moip t significant , djgltp of stored previously, 
calculated constants designated by said address, an<£ an arittimetic ..circuit composed of at least one mul- 
tiplier and at least one adder and receiving said less significant digit portion L of said initial, value (x) and 
an output of said memories for executing the following polynomial : 

2. An arithmetic operation apparatus claimed in Claim 1 wherein said k is 3 aril said arithmetic circuit is com- 
posed of three multipliers and tfiree adders so thata value of the elementary function can be obtained with 
precision of 4n digits. \ : ' v >v v .. . • \ m .^ i}U >*\- f{ ;v ■• "-\ r^ \ 

3. An arithmetic operation apparatus claimed in Claim i wherein said k is 3 and said arithmetic circuit is com- 
- posed of one multiplier and one adder so that a value of the elementary function can be obtained with pre- 
cision of 2n digits. 

4. An arithmetic operation apparatus claimed in Claim 1 wherein said elementary function includes at least 
one of a sine function sin(x), a cosine function cos(x). arc tangent function arctan(x) , an exponential function 
e\ a logarithm function log a (x) t an inverted number 1/x, a square root Vx, and an inverted number of a 
square root 1/>/x. 
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FIGURE 2 
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